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SWEEPING WORDS AND THE LENGTH OF A 
GENERIC VECTOR SUBSPACE OF M„(F) 

IGOR KLEpi AND SPELA SPENKO^ 


Abstract. The main result of this short note is a generic version of Paz’s conjecture on the 
lengths of generating sets in matrix algebras. Consider a generic g-tuple A = {A\,... ,Ag) 
of n X n matrices over a field. We show that whenever > n^, the set of all words of 
degree 2d in A spans the full n x n matrix algebra. Our proofs use generic matrices, are 
combinatorial and depend on the construction of a special kind of directed multigraphs with 
few edge-disjoint walks. 


1. Introduction 

Let F be a field and let A be an associative F-algebra. Given a generating set 1 € 5 of A 
let 5^ denote the set of all products of the form si - ■ ■ with s* G S. If 

spanS*^”^ C spans'^ = A, 

we say that S has (generating) length 1 . These lengths feature prominently in the study of 
growth of algebras and the Gelfand-Kirillov dimension [KLOO, BoKr76]. 

A fundamental problem is to find bounds on the length of generating sets. Much activity 
has focused on A = M„(F) (see e.g. [Paz84, FGG97, Pap97, LRU, Rosl2]), where the best 
known bound on the length of generating sets is 0(n^/^), due to Pappacena [Pap97]. The Paz 
conjecture [Paz84] states that the bound is 2n — 2, and it is easy to see that this bound would 
be sharp. We refer the reader to [LS09] for the study of an analogous problem in groups. 

In this short note we establish a strengthening of the Paz conjecture in a generic setting: 
the length of a generic generating set in M„(F) is O(logn) (Gorollary 2.3). To prove this 
bound we establish the existence of sweeping words u;i,..., of degree 2[logg n] in g freely 
noncommuting letters xi,... ,Xg. That is, there exist (symmetric) n x n matrices Ai,..., Ag 
such that wi{Ai, ... ,Ag), ... ,r(;„ 2 (Ai,... ,Ag) span M„(F); see Theorem 2.2. Here is a simple 
corollary. Given g'^’^ > consider the set of all words w of length 2d in g matrices A G M„(F)®. 
Vectorize each matrix w{A) and arrange these vectors into a matrix (of size n‘^xg‘^'^). Gorollary 
2.4 shows that this matrix is generically of full rank, generalizing a theorem of Rosenthal 
[Rosl2] who established the special case d = 1. 
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The key step in the proofs is the construction of a special kind of directed multigraphs with 
few edge-disjoint walks (Subsection 3.1). Another ingredient going into our proofs are generic 
matrices and their properties [RowSO, GRZ03]. 

The paper is organized as follows. Section 2 gives notation, preliminaries, and presents our 
main results on sweeping words and lengths in matrix algebras. Section 3 gives proofs of our 
results, including the graph-theoretic construction in Subsection 3.1. 

Acknowledgments. The authors thank Benoit Collins for several stimulating discussions, 
Jason P. Bell for sharing his expertise, and Jurij Volcic for carefully reading a preliminary 
version of the manuscript. 


2. Main results 


2.1. Notation and preliminaries. Let ¥ denote an infinite field and M„(F) the algebra of 
n X n matrices over F. We denote the free associative algebra generated by xi,... ,Xg by 
F(a:i, ..., Xg). By (xi, ..., Xg) we denote the free monoid generated by xi,... ,Xg, and by 
(xi,..., Xg)d words in (xi,..., Xg) of length d. In case g = 2 we write x, y instead of xi, X 2 . 
The set {1,..., d} is denoted by N^. 


(k) 

2.1.1. Generic matrices and the discriminant. We denote by C* = F[x) J \ ^ < i, j < n,l < 


u 

k < g] a commutative polynomial algebra. The elements G Mn{C), 1 < k < 

g, are called generic matrices. The discriminant ..., of n x n matrices 

is the determinant of the n^xn^ matrix whose /c-th column is the vectorized 

,(C 


matrix i.e., . 




2.1.2. Locally linearly independent words. We say that wi,... ,Wm G (xi,... ,Xg) are M„(F)- 
locally linearly independent if r(;i(A),... ,Wm{A) are linearly independent for some A € 
Mn(¥y. This concept first appeared in [CHSY03], later it was studied algebraically in 
[BrK113], and recently in [BPS15]. 

Since we assume that F is infinite we have the following easy observation. 

Lemma 2.1. Words wi,..., w ^2 G (xi,..., Xg) are M„(F)-locally linearly independent if and 
only if the discriminant of wi{Xi, ..., Xg), ..., 1 x ^2 (Ai,..., Xg) is nonzero. 

2.2. Main resnlt on words. 


Theorem 2.2. Let g > 2 and d = [log^n]. Then there exist Mn{¥)-locally linearly inde¬ 
pendent words wi,.. . ,Wn2 G {xi,... ,Xg)2d- That is, for some A G M„(F )5 the matrices 
wi{A),... ,Wn 2 {A) are linearly independent and thus span Mn{¥). 

2.2.1. Length of a vector space. Let P be a vector subspace of Mn{¥). By we denote the 
vector space spanned by the words of length at most k evaluated at V. The length of V is 
the integer i yielding a stationary chain 

P C p2 c ... C = P^+\ 

We say that words wi,... ,Wt G (xi,..., Xg) sweep M„(F) if there exists A G M„(F)^ such 
that wi{A ),... , wt{A) span M„(F). Given a subset 5 of M„(F)^, a vector space P C M„(F) of 
dimension g is S'-generic if it can be spanned by elements Ai,..., A^ satisfying (Ai, ... ,Ag) G 

5. 
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Corollary 2.3 (Generic version of Paz’s conjecture). Let g >2 and let F be an infinite field. 
There exists a nonempty Zariski open subset S C Mn(F)® such that the length of an S-generic 
vector subspace o/M„(F) is of order O(logn). 

Note that if F G {M, C}, then a nonempty Zariski open subset of F”^ is automatically dense 
in the Euclidean topology. 

2.2.2. Words in random matrices span the full matrix algebra. By Corollary 2.3, given a g- 
tuple A of random n x n matrices, words of length O(logn) in A span M„(F). In particular, 
we have: 

Corollary 2.4. For each g satisfying < g‘^‘^ there exists a set of g matrices such that words 
of length 2d in those matrices span M„(F). 

Corollary 2.4 partially answers a question posed in [Rosl2], where this result is established 
in the case d = 1. The answer is complete for n € N satisfying < g^'^. The question 

whether the words of length 2d — 1 sweep M„(F) in the case remains. 

3. Proofs 

The proof of Theorem 2.2 reduces to the study of a special kind of graphs which we introduce 
in the first subsection. The main ideas can be revealed already in the case of two variables, 
and the general case is only notationally more difficult, so we focus on g = 2. In the next 
proposition we state for convenience this special case separately. 

Proposition 3.1. Let d = |'log 2 n]. There exist Mn{¥)-locally linearly independent words 

wi, ...,Wn2 € {x,y)2d- 

3.1. Graphs. We recursively construct a family of graphs {Gci)den- Let Gq be a graph with 
one vertex labeled by 1 and no edges. We let Ga be a (directed) graph with 2'^ vertices 
labeled by N 2 d, and define the edges as follows. There is a (directed) edge from i to j in Gd 
of multiplicity 4e, 1 < i,j < 2'^“^, if there is an edge of multiplicity e from i to j in Gd-i. 
Moreover, each vertex i, 1 < i < 2'^~^, has additionally 2^~^^ loops, and there are 2*^ edges 
from i to i 2‘^~^ and back for 1 < i < 2'^“^. We label the loops by x, and other edges by y. 
Note that Gd contains 2‘^'^~^^d edges, half of them labeled by x and the other half by y. 

For example, the hgures below show Gi and G2 with the numbers on edges corresponding 
to their respective multiplicities, and instead of the labels x and y we use dashed (resp. solid) 
edges for the edges corresponding to x (resp. y). 


I 

'4 

J 



Figure 1. Gi 
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Figure 2. G2 


If p is a walk in the graph then we associate to it a word corresponding to the labels on 
the edges passed by p in the respective order. 

We now state a technical lemma that will be used extensively in the proof of Proposition 

3.1. 

Lemma 3.2. There is a unique way of partitioning the graph G^ in 2^^^ (edge-disjoint) walks 
Pij of length 2d, I < i,j < 2'^, such that pij starts at i and ends at j, which yield all the words 
in {x,y)2d- 

Proof. We prove the lemma by induction on d. Let us denote by G^P^ the graph obtained 
from Gd by multiplying the multiplicity of each edge by m. 

We claim that can be in only one way partitioned into m2^'^ walks of length 2d such 
that m walks start at i and end at j, 1 < i,j < 2'^, and such that each word in {x,y) 2 d 
corresponds to m walks. Consider first G^K Then the only way of obtaining the desired 
partition is to take m walks {2 —>■ 1,1 —>■ 2}, m walks {1 —>■ 1,1 —>■ 2}, m walks {2 —>■ 1,1 ^ 1} 
and m walks {1 —>■ 1,1 —>■ 1} as can be easily seen. Suppose that the claim holds for all 
graphs i < d. Consider now G^™'^ Since there are no loops on the vertices labeled by i, 

2*^“^ + 1 < i < 2'^, all words with the starting or ending point in these vertices need to begin, 
respectively end, with y. By the condition on the partition there are exactly half of walks 
with this property, thus words with other starting, resp. ending, point need to begin, resp. 
end, with x. Removing the edges starting or ending at i, 2*^“^ + 1 < i < 2^^, and loops 

on vertices i, 1 < i < 2*^“^, we obtain a graph on 2*^“^ vertices labeled by N 2 d-i (ignoring the 
isolated points) which coincides with by construction, and which we need to partition 

into walks of length 2{d — 1) (as we have already removed the starting and the 

ending edge of walks in G^™^) such that 4m walks start at i and end at j, 1 < i,j < 2*^“^, and 
each word in (x, y) 2 d -2 corresponds to 4m walks. By the induction hypothesis, there is only 
one such a partition. The lemma thus follows by taking m = 1. ■ 

For the proof of Theorem 2.2 we will need a slight generalization of the previous lemma. 
We thus introduce a graph G^ which has g'^ vertices and is defined recursively by setting Gq 
to be the graph with 1 vertex labeled by 1 and no edges. Having constructed G^_^ we let G^ 
be a directed graph with vertices labeled by and having a (directed) edge from i to j 
of multiplicity g^e, i,j € N^d-i, if there is an edge of multiplicity e from i to j in G^_^, and 
is labeled as the corresponding edge in G^_^, and there are g'^ edges from i to i + {k — l)g'^~^ 
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and back for 1 < * < labeled by 1 < k < g (for k = 1 every loop has multiplicity 

Note that contains 2dg'^'^ edges. 

Lemma 3.3. There is a unique partition of the graph in g'^'^ (edge-disjoint) walks pij of 
length 2d, 1 < < 2“^, such that pij starts at i and ends at j, which yield all the words in 

{x\ , . . . , Xg)2d- 

The proof of Lemma 3.3 is a straightforward modification of Lemma 3.2 and is omitted. 

3.2. Proof of Theorem 2.2. 


Proof of Proposition 3.1. By Lemma 2.1 we need to show that 


p{xii 

:••••> ^nn ; yil, • • • , Vnn) ■= Y),..., Wn2 {X, Y)) 

is nonzero for some rci,..., t (;„2 G {x,y) 2 d, where X = (xij), Y = (inj) are generic n x n 
matrices. (We may and we will assume that X is diagonal [RowSO, Proposition 1.3.15].) By 
the definition of the discriminant, 

(3.1) p{xu,...,Xnn,yU,---,ynn) = 11 '^<r{kip{X ,Y)ij, 

(TeS^2 


where kij = {i — l)n + j, and Wk{X,Y)ij denotes the commutative polynomial at the entry 
{i,j) of the word Wk evaluated at the generic matrices X, Y. 

Let us define the lexicographic order on (x, y) with x > y and denote by Vg the vector of 
the words of length s listed decreasingly with respect to this order. Let 


(3.2) 


= enVdVd^n = 


XVd-lV'‘^_lX 

en'yVd-lVd-lX 


XVd-lV^d-iy^ri' 

Sn'yVd-lVd-iy^n' 


be the block matrix consisting of words, where n' = n — 2*^“^ and e^' = en -!-••• + On'n'j with 
blocks of the size 2'^“^ x 2'^“^, 2'^~^ x n', n' x 2'^“^, and n' x n', respectively. 

We proceed to find a monomial that appears in the product on the right-hand side of (3.1) 
for a unique a € 5^2. Let 2'^~^ < n < 2'^ and x* = xa. We define 




XiXj 

Xiyj_2d-ij 


We further inductively define 


if 1 < i,j < 

if 1 < z < 2'^“^, 2'^“^ < j Y n, 
if 2'^-^ <i <n,l < j < 2'^-\ 
if 2^~^ < i,j < n. 


m 


( 11 ) 


= L 



where id ^ i mod 2*^ = j mod 2*^ 1 < idiJd <2'^ Consider the monomial defined 

by 

mn= 

In particular, in the case n = 2*^ we have 

m2d = {m2d-if m^2d- 

1<*J<2'1 
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Let W(i_i^n+j = for defined in (3.2). We claim that rUn appears in 

Pn= n 

only for a = id. By the construction of mn^\mn and (3.2), rrin^'^ has a nonzero coefficient in 
the commutative polynomial a\j\x,Y)ij and thus the same holds for the monomial in 

n Wk,^{X,Y)ij= n aff{X,Y)ij. 

It remains to show that does not appear in for a ^ id. For this we use a graph-theoretic 
language. 

We hrst consider the case n = 2'^. We can present the monomial rrin as a walk in a graph 
on n vertices, in which there is a directed edge of multiplicity Sij between vertices i and j 
labeled by y if Sij is the degree of yij in the monomial nin, and there are Si loops on the vertex 
i labeled by x if Si is the degree of Xi in m„. Since needs to be written as a product of 
monomials Uij, 1 < < n, arising from monomials in Wfj(^k.^){X,Y)ij, Wkij G {x,y) 2 d, 

our problem reduces to finding partitions of the graph associated to rUn into walks pij, 
1 < L j < JT') of length 2d that yield all the words in {x,y) 2 d- Lemma 3.2 asserts that there is 
only one such partition, and thus concludes the proof in the case n = 2'^. 

For arbitrary n we observe that if 

E (- 1 )’ n 

equaled 0 then rUn would appear in and in some other Pn with p G 5^2. As rrin^'^ can 
(ii) 

be identified with fo^' 1 E j < ^- 2 ^* would have a nonzero coefficient in the product 

P^a for cj = id and a = p G S 2 d (here p is the permutation in S 2 d induced by p and fixing all 
i > n), which is impossible by the claim proved in the previous paragraph. Thus the words 
a^ii ,..., a^n are M„(F)-locally linearly independent. ■ 


Proof of Theorem 2.2. One follows the steps of the proof of Proposition 3.1 where initially one 
needs to consider the case n = and defines the monomial inductively corresponding to 
the matrix 


(3.3) 


/ XlVd-lv\_^Xl 


xiVd-iv^d-i^g-^ 


XiVd-lV^d-lXgen' \ 


afo) = enVdV^en 


Xg-lVd-lVd-lXl 

\ en'XgVd-lV^d-lXl 


Xg-lVd-lV^d-l^g-i Xg-lVd-lV^d-l^g^n' 
en'XgVd-iv^d-i^g-i en'XgVd-iv^d-i^g^n' 


where g'^~^ < n < g'^, n' = n — g'^~^, and Vg denotes the vector of g^ words of length s listed 
decreasingly in the monomial order induced by setting xi > ■ ■ ■ > Xg. Instead of applying 
Lemma 3.2 one concludes the proof by applying Lemma 3.3. ■ 
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Proof of Corollary 2.3. Let wi,... , 10^2 be the M„(F)-locally linearly independent words of 
length 2d = 2[loggn] whose existence was established in Theorem 2 . 2 . As then the discrim¬ 
inant A := A(u)i(Xi,... ,Xg),... ,rc„ 2 (Ai,... ,Xg)) is nonzero, the subset 5 of A € M„(F)^ 
where A does not vanish is a nonempty Zariski open subset, and therefore dense in M„(F)^. 
In the case F G {M, C}, S is also dense in the Euclidean topology. By the definition of S it 
follows that every 5-generic vector subspace of M„(F) is of length O(logn). ■ 


Proof of Corollary 2.4. Choose Ai,..., Ag such that (Ai,..., Ag) € 5 C M„(F)^, where 5 is 
the smallest among m G N satisfying > n, and S is the set described in the proof of 
Corollary 2.3. For the remaining A^+i, ... ,Ag arbitrary n x n matrices will do. ■ 


Remark 3.4. (a) It is not difficult to see that one can take the matrices in Corollary 2.4 to be 
symmetric. One only needs to note that the proof of Lemma 3.3 also works for the undirected 
version of the graphs and then use these in the proof of Theorem 2.2. 

(b) The proof of Proposition 3.1 also leads to an explicit construction of n x n matrices such 
that words of degree 2d = 2\loggn\ in these matrices span M„(F). We give an example in 
characteristic 0. Keep the notation from the proof of Corollary 2.4. Let M = n!(n^'^“^)”’. We 

set all variables that do not appear in to zero, and denote by C the polynomial algebra in 

ik) il) 

the remaining variables. Let us order the variables as follows: 

(resp. ^ (^^sp. {s,k,i) < in the 

lexicographic order, and take the corresponding lexicographic ordering on C. Let 

d 

Cl = 3, Cs = 2f~^{g-l)+f~‘^{s>l), c = ^Cs. 

s=l 

We further dehne for 5 *“^ < * < and 1 < j < g^~^, 

S—1 

fl,s,i+ = = ^Ct+j <j< f~^), 

t=0 

s—1 s—1 

fk,s,j+ = ^ ct + 9^ ^ + 25 ® ^ (A; — 2) -|- j, fk,s,j- = ^ ct + 9^ ^ + 2^^ ^{k — 2) + ^ j. 

t=o t=o 


We set 


Aik) 




(k) 




Since the monomial is the maximal monomial in C, the degree of monomials appearing 
in A(r(;i(Ai,..., A^),... ,rCjj 2 (Ai,... ,Xg)) is 2d, and there appear at most M monomials in 
A (counted with multiplicity). It is easy to see that the constructed A^^^ 1 < A: < 5 , and 
arbitrary A^^\ 9 < k < g, have the desired property of Corollary 2.4. 

(c) It would be interesting to know whether arbitrary words in x,y of hxed length 
d > [ 21 og 2 n] sweep M„(F). If the answer were positive then we could deduce that a quasi¬ 
identity of M„(F) (see [BPS15] for the dehnition) YIm ^mM with degM = d cannot be a sum 
of fewer than monomials, and this bound is sharp. This should be seen in contrast with 
[RowSO, Exercise 7.2.3], stating that a multilinear polynomial identity of M„(F) cannot be a 
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sum of fewer than 2”' monomials. However, the sharp bound is to the best of our knowledge 
not known. 
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